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We investigate the constraining power of current and future Sunyaev-Zeldovich cluster surveys 
on the f(R) gravity model. We use a Fisher matrix approach, adopt self-calibration for the mass- 
observable scaling relation, and evaluate constraints for the SPT, Planck, SPTPol and ACTPol 
surveys. The modified gravity effects on the mass function, halo bias, matter power spectrum, and 
mass-observable relation are taken into account. We show that, relying on number counts only, 
the Planck cluster catalog is expected to reduce current upper limits by about a factor of four, to 
o fR0 = 2 x 10~ 5 (68% confidence level) while SPT, SPTPol and ACTPol yield about 3 x 10" 5 . 
Adding the cluster power spectrum further improves the constraints to <Jf R0 — 5 x 10~ 6 for Planck 
and o~f R0 = 2 x 10 - for SPTPol, pushing cluster constraints significantly beyond the limit where 
number counts have no constraining power due to the chameleon screening mechanism. Further, the 
combination of both observables breaks degeneracies, especially with the expansion history (effective 
dark energy density and equation of state). The constraints are only mildly worsened by the use of 
self-calibration but depend on the mass threshold and redshift coverage of the cluster samples. 



I. INTRODUCTION 

One of the most fascinating aspects of contemporary 
cosmology is the potential of constraining fundamental 
physics with the plethora of available data. Galaxy clus- 
ters constitute one of the major tools we can use to this 
aim. The biggest gravitationally bound objects in the 
Universe, they have formed fairly recently and several of 
their global properties such as abundance and clustering 
on large scales can be predicted accurately with theo- 
retical models (e.g., [1, 2]). For this reason, they have 
been extensively used in the past in order to constrain 
fundamental parameters such as the total matter den- 
sity and the matter power spectrum normalization [3-7]. 
When combined with other cosmological data at various 
redshifts, clusters can also be used to constrain particle 
physics and neutrino properties [8-10]. 

Given that gravity is the only relevant force in the for- 
mation of structure in the Universe on large scales, cos- 
mological observations are uniquely suited to test gravity 
on scales of Mpc, complementing Solar System tests on 
AU scales. In recent years, clusters have received consid- 
erable interest as a probe of gravity [11, 12]. Modifica- 
tions to gravity generically change the growth of large- 
scale structure (e.g., [13, 14]), and clusters at the high- 
mass tail of the mass function are especially sensitive to 
changes in the growth rate. This has been exploited in 
Schmidt et al. [15] who used a sample of X-ray clusters 
to constrain f(R) gravity. Similarly, Lombriser et al. [16] 
used an optical Sloan cluster sample. A consistency test 
of the General Relativity + smooth Dark Energy frame- 
work using clusters was done in [17]. 

Here, we focus on the f(R) model of gravity, using the 
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functional form proposed in [18]. This model produces 
acceleration without a true cosmological constant, and is 
indistinguishable from ACDM through geometric probes 
(CMB, Supernovae, H , BAO measurements). However, 
gravitational forces are modified on smaller scales. Fur- 
thermore, the model includes the chameleon screening 
mechanism which restores General Relativity in high- 
density environments. Thus, this model is able to sat- 
isfy all current constraints on gravity. Structure forma- 
tion in this modified gravity model is now understood 
on all cosmological scales: the linear regime of structure 
formation in this f(R) model has been studied in [18]. 
The non-linear structure formation was investigated us- 
ing dedicated N-body simulations in [19-22]. This allows 
for fully self-consistent constraints and forecasts to be 
made for this model. 

While cluster samples have mainly been selected in 
the optical and X-ray bands in the past, recent obser- 
vations based on the Sunyaev-Zeldovich (SZ) effect are 
starting to produce new detections [23-27]. The SZ ef- 
fect consists in CMB photons inverse-Compton scatter- 
ing off electrons in the intra-cluster medium. This pro- 
cess causes a distortion in the CMB blackbody spectrum, 
and a frequency-dependent brightness change [28] . What 
makes SZ clusters particularly interesting as cosmolog- 
ical probes is the unique, almost redshift-independent 
sensitivity for detecting clusters. As a consequence, SZ 
surveys have the potential to discover clusters at high 
redshift where optical and X-ray surveys are not very 
efficient. This new probe is receiving significant atten- 
tion because of additional data expected from ongoing SZ 
surveys like Planck, the Atacama Cosmology Telescope 
(ACT), and the South Pole Telescope (SPT) in the near 
future. 

In this paper, we explore to what extent these new 
cluster surveys are expected to constrain /(i?) models 
through cluster number counts and clustering. The paper 
is organized as follows. We begin by presenting the sur- 
veys and expected cluster samples in Sec. II. This is use- 
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ful as the modified gravity effects discussed throughout 
the paper depend sensitively on the characteristics of the 
cluster samples. In Sec. Ill we present the parametriza- 
tion of modified gravity effects on the halo abundance 
and clustering. Sec. IV details the Fisher formalism em- 
ployed here, as well as the fiducial cosmology adopted. 
The forecasted constraints are presented in Sec. V. We 
discuss our results in Sec. VI and conclude in Sec. VII. 



approach is consistent with the one in [35]. While we 
keep V 2 oo. Pc = 2 x 10~ 3 arcmin 2 as our reference mini- 
mum value for presentation of the main results, we will 
also discuss predictions for a lower mass threshold, corre- 
sponding to Y"200,p c = 10~ 3 arcmin 2 . With such threshold, 
the completeness of the S/N > 5 sample is reduced to 
about 70% and the total number of clusters is 2700. 



II. CLUSTER SURVEYS 

We will investigate the predictions for the four surveys 
described in the following. While we try to obtain as re- 
alistic survey specifications as possible, in particular for 
the mass limit as function of redshift Mi im (z), the lack of 
previous large samples of SZ clusters necessarily makes 
these quantities somewhat uncertain. In particular, the 
relation between cluster mass and SZ signal is still im- 
perfectly known (e.g. [29-32]). The final mass limits as a 
function of redshift are shown in Fig. 1, and the resulting 
expected number of clusters for each survey is shown in 
Fig. 2. 



A. The Planck Catalog 

Planck is imaging the whole sky with an unprecedented 
combination of sensitivity (AT/T ~ 2 x 10 -6 per beam 
at 100 - 217 GHz), angular resolution (5' at 217 GHz), 
and frequency coverage (30 — 857 GHz). The SZ sig- 
nal is expected to be detected from a few thousand in- 
dividual galaxy clusters. Planck will produce a cluster 
sample with median redshift ~ 0.3 (see Fig. 2, upper 
left panel). The SZ observable is the integrated Comp- 
tonization parameter Y = J y driciustcr out to a given 
radius. For Planck, a 5<r detection threshold ensuring 
high level of completeness (about 90%) corresponds to 
^200, p c > 2 x 10 _3 arcmin 2 [33], where Y 2 oo, Pc is the inte- 
grated comptonization parameter within r2oo.p c , the ra- 
dius enclosing a mean density of 200 times the critical 
density. The early release from the Planck Collabora- 
tion gives a sample of 189 high signal-to- noise SZ clusters 
with > 6cr detection. It is therefore likely that our as- 
sumed detection threshold will be eventually reached in 
future data releases. For an SZ survey, its flux limit can 
be translated into a limiting mass by using simulation- 
calibrated scaling relations [34]: 
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In order to mitigate the effect of overestimation of 
unresolved clusters at low redshift, we further restrict 
Mim,2oop c to be at least 10 14 M Q at all z. With all 
these criteria, the Planck survey is expected to detect 
~ 1000 clusters. The mass threshold we find with this 



B. SPT and SPTpol 

The SPT survey is currently observing the sky with a 
sensitivity of 18^K/arcmin 2 at 148 GHz, 218 GHz, and 
277 GHz. This survey covers f£ « 2500 square degrees 
of the southern sky (between 20h > RA > 7h, -65° < 
S < —30°) with a projected survey size and cluster mass 
limit well matched to the Stage III survey specification 
of the Dark Energy Task Force [36]. For the mass limits, 
we employ the calibrated selection function of the survey 
by [36] . This is based on simulations and used to provide 
a realistic measure of the SPT detection significance and 
mass. Disregarding the scatter in the fitting parameters 
for this relation, we use here: 



Mim,200p( z ) 

5 x 10 14 M Q /i- 
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1/1.31 
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(2) 

For the SPT survey, 
which ensure a 90% 



where £ is the detection significance, 
we take clusters detected at £ > 5 
purity level. Currently, the SPT team is setting a low 
redshift cut at z cu t = 0.3 in their released cluster sample, 
due to difficulties in reliably distinguishing low-redshift 
clusters from CMB fluctuations in single frequency ob- 
servations. Nevertheless, with upcoming multi-frequency 
observations, a lower cut z cut = 0.15 will likely be at- 
tained. We therefore apply this cut in our work. With 
this, the SPT survey is expected to detect ~ 500 clusters. 

In addition to this, we also consider the upcoming SPT 
polarization survey (hereafter SPTpol) which will have 
an increased sensitivity of 4.5^K/arcmin 2 at 150 GHz 
for a 3 year survey and sky coverage of 625 square de- 
grees. We scaled the mass limits by a factor of 3.01/5.95 
in Eq. (2) to match with the expected mass limits of 
SPTpol clusters (Benson 2011, private communication). 
We again use z cut — 0.15, resulting in a total expected 
number of ~ 1000 clusters. While these are the limits we 
use for our main results, we also discuss outcomes that 
consider a lower mass limit, corresponding to £ = 4.5 
(80% purity). With this mass limit, SPT would find 800 
clusters and SPTPol about 1400 clusters. 



C. ACTpol 

The Atacama Cosmology Telescope (ACT) has been 
observing a portion of the southern sky since 2008 con- 
sisting of two strips of the sky, each 4 degrees wide in dec- 
lination and 360 degrees around in right ascension, one 
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strip is centered at S = —5°, and the other is centered at 
5 = —55° [34]. With a sensitivity of w 35yuK/arcmin 2 , 
only about 100 clusters are expected to be detected. In- 
stead, we turn to the newly developing dual-frequency 
(150 GHz and 220 GHz) polarization sensitive receiver 
(hereafter ACTpol [37] and reference therein) to be de- 
ployed on ACT in 2013. One of the three ACTpol observ- 
ing seasons will have a wide survey covering 4000deg 2 to a 
target sensitivity of 20/iK/arcmin 2 in temperature at 150 
GHz. With the wide field, they aim to find ~ 600 clusters 
in the ACTpol survey. The survey is 90% complete above 
a limiting mass of Mii mi 200p = 5 x 1O 14 M /i -1 (Sehgal 
2011, private communication), and we therefore assume 
this as our redshift-independent mass limit for ACTpol. 
As in SPT, the ACT team also put a low redshift cut 
in their parameter determination works and we likewise 
take z cu t = 0.15 for ACTpol, resulting in a total expected 
number of ~ 500 clusters. We also present in the discus- 
sion section the results corresponding to a lower mass 
limit, Mii m) 200(5 = 4 x 10 14 Af Q /i _1 , which would result in 
a catalog of about 1000 clusters. 



III. THEORETICAL MODELING 

A. f(R) gravity 

In the f(R) model (see [38, 39] and references therein), 
the Einstcin-Hilbert action is augmented with a general 
function of the scalar curvature R [40-42], 



R + f(R) 

16ttG 
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Here and throughout c = h = 1. This theory is equiva- 
lent to a scalar-tensor theory (if the function / is nontriv- 
ial). The additional field given by fn = df/dR mediates 
an attractive force whose physical range is given by the 
Compton wavelength Ac = a^ 1 (3dfn/dR) 1 ^ 2 . On scales 
smaller than Ac, gravitational forces are increased by 
4/3, enhancing the growth of structure. 

A further important property of such models is the 
non-linear chameleon effect which shuts down the en- 
hanced forces in regions with deep gravitational poten- 
tial wells compared with the background field value, 
1*1 > \Jr{R)\ [18, 43]. This mechanism is necessary 
in order to pass Solar System tests which rule out the 
presence of a scalar field locally. Thus, Solar System 
tests constrain the amplitude of the background field to 
be less than typical cosmological potential wells today 

(~ io- 5 ). 

In this paper, we will choose the functional form intro- 
duced by Hu & Sawicki [18]: 
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a cosmological constant. Nevertheless, as R ^> /i 2 , the 
function f(R) can be approximated as 



f(R) = -2A-f m ^ 



(5) 



with fno = —2A/j 2 /Rq replacing as the second param- 
eter of the model. Here we define -Ro = R{ z — 0), so 
that fno = fn(Ro), where overbars denote the quantities 
of the background spacetime. Note that fm < implies 
fn <0 always, as required for stable cosmological evolu- 
tion. If \fno\ "C 1, the curvature scales set by A = O(Rq) 
and /i 2 differ widely and hence the R 3> /i 2 approxima- 
tion is valid today and for all times in the past. 

The background expansion history thus mimics ACDM 
with A true cosmological constant to order f R Q. 

Therefore in the limit \f R0 \ < 10~ 2 , the f(R) model 
and ACDM are essentially indistinguishable with geomet- 
ric tests. The linear growth rate is identical to that of 
ACDM on scales larger than Ac, and becomes strongly 
scale-dependent on smaller scales [18]. 

Note that we have chosen a model whose expansion his- 
tory is close to ACDM by construction. In general, there 
is sufficient freedom in the free function / to emulate any 
given expansion history [44]. Hence, below we will also 
allow the expansion history to vary, parametrized by ef- 
fective dark energy parameters wq and w a . Further, while 
we choose a specific functional form for f(R) here, it is 
straightforward to map constraints onto different func- 
tional forms (see [45] for details). In the following, for 
notational simplicity fno will always refer to the abso- 
lute value of the field amplitude today. 



B. Cluster abundance in f(R) 

Studying structure formation in f(R) gravity beyond 
linear theory is complicated by the non-linear field equa- 
tion for the scalar field Jr, the non-linearity being re- 
sponsible for the chameleon mechanism. The field equa- 
tion needs to be solved simultaneously with the evolution 
of the matter density. This has been done in the self- 
consistent N-body simulations of [19]. The abundance of 
dark matter halos (mass function) and their clustering 
(halo bias) in the f(R) simulations was studied in [21]. 

Since these simulations are very time-consuming, they 
cannot be used to exhaust the cosmological parameter 
space. Instead, we use a simple model developed in 
[21] based on spherical collapse and the peak-background 
split in order to predict the cluster abundance and their 
linear bias. 

In order to describe the effect of f(R) gravity on the 
halo mass function, we employ the Sheth-Tormen pre- 
scription for the comoving number density of halos per 
logarithmic interval in the virial mass M v , given by 



with two free parameters, A, /i 2 . Note that as R —> 
0, f(R) —> 0, and hence this model does not contain 
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where the peak threshold v — 5 c /a(M v ) and 

vf(u) = A^au 2 [l + {av 2 )- p ] exp[-av 2 /2] . (7) 

Here a(M) is the variance of the linear density field con- 
volved with a top hat of radius r that encloses M — 
47rr 3 p m /3 at the background density 
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where Pi(fc) is the linear power spectrum (either in 
ACDM or in f(R)) and W is the Fourier transform of the 
top hat window. The normalization constant A is chosen 
such that J dvf(v) — 1. The parameter values of p = 0.3, 
a = 0.75, and 5 C = 1.673 for the spherical collapse thresh- 
old have previously been shown to match simulations of 
ACDM at the 10 - 20% level. The virial mass is defined 
as the mass enclosed at the virial radius r v , at which the 
average density is A v times the mean density. We trans- 
form the virial mass to the desired overdensity criterion 
A = 500/51 m assuming a Navarro- Frenk- White [46] den- 
sity profile [47] , and assuming the mass-concentration re- 
lation of [48] (note that the rescaling depends very weakly 
on the assumed halo concentration for the values of A 
used here). We thus obtain the mass function of halos in 
the ST prescription, n^ ST \ from nf T \ 

The effects of f(R) modified gravity enter in two ways 
in this prescription: first, we use the linear power spec- 
trum for the f(R) model in Eq. (8). Second, we as- 
sume modified spherical collapse parameters which were 
obtained by rescaling the gravitational constant by 4/3 
during the collapse calculation as well as the correspond- 
ing linear growth extrapolation to obtain S c . This cor- 
responds to the case where the collapsing region is al- 
ways smaller than the Compton wavelength of the field. 
Schmidt et al. [21] showed that this case always underes- 
timates the f(R) effects on the mass function and bias, 



and hence serves as conservative model. For our fiducial 
cosmology at z = 0, we obtain GR collapse parameters 
of S c = 1.675, A v = 363, while the modified parameters 
are given by S c = 1.693, A v = 292. The Sheth-Tormen 
prescription itself does not provide a very accurate pre- 
diction for the abundance of clusters in ACDM in the 
entire rcdshift range relevant for SZ surveys. Since more 
precise parametrizations are available, we only use the 
ST prescription to predict the relative enhancement of 
the cluster abundance in f(R). Specifically, after rescal- 
ing to our adopted mass definition, we take the ratio of 
the two and multiply it by the ACDM mass function from 
Tinker et al. [1], 
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where we use the parameters given in their Appendix 
B. Note that for small field values and at high masses, 
the predicted f(R) mass function in fact becomes smaller 
than that for ACDM. Since this effect is not seen in the 
simulations, we conservatively set the mass function ratio 
to 1 whenever it is predicted to be less than 1. 

Fig. 2 shows the number of clusters as a function of 
redshift expected from the four surveys considered in 
this work (see Sec. II), and the relative deviations of the 
f(R) model from the ACDM model for different values 
of fm (dashed lines). The f(R) modifications are most 
prominent at low redshifts z < 0.4, since the changes 
in the linear power spectrum are restricted to progres- 
sively smaller scales towards higher redshifts. Further, 
for fno < 5x 10 -5 , we see the strongest effects for surveys 
with the lowest mass thresholds, in particular Planck (for 
z < 0.15) and SPTpol. This is a consequence of the 
chameleon mechanism which suppresses the mass func- 
tion enhancement above progressively lower masses as 
fuo decreases. There are negligible differences between 
f(R) and ACDM for f R0 < 3 x 10~ 5 at high halo masses. 
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FIG. 2: Upper left. The redshift distribution of clusters in the Planck (black), ACTpol (blue), SPT (green), and SPTpol 
(magenta) survey in the fiducial ACDM cosmology. Upper right, bottom left, bottom right. The fractional deviation of the 
number density between f(R) and ACDM models due to all effects of f(R) (dotted lines), Ani n effect only (dashed lines), and 
dynamical mass effect only (dot-dashed lines) evaluated at fao = 10 -4 , fno = 5 x 10 -5 , and fno = 2 x 10~ 5 respectively. For 
the largest field value, the effect on the mass function dominates the enhancement of the cluster abundance at low z, while the 
dynamical mass effect dominates at z > 0.3. The Planck and SPTpol survey have the lowest mass threshold at z < 0.2 and 
z > 0.2 respectively and hence are most sensitive to the f(R) effects for small field values. 



Hence, the mass threshold of a given survey determines 
what field values can be probed by number counts. 

Further, we have to take into account the effect of mod- 
ified gravity on the mass-observable relation. The SZ ef- 
fect is a dynamical mass measure, as the decrement Y is 
proportional to the velocity dispersion (pressure) of elec- 
trons. In modified gravity, dynamical mass estimates are 
generally different from the actual mass due to the pres- 
ence of the additional gravitational force which enters the 
virial equation. As shown in [49] , the dynamical mass is 
related to the true mass via 

M dyn = J 3/5 M, (10) 

where J is a weighted integral of the force modification 
over the object which describes the effect on the virial 
equation. In principle, J should be weighted by the SZ 
cmissivity and observational window function. However 



in the interest of simplicity, and since we are only in- 
terested in an approximate forecast, we simply weight 
the modified forces by the matter density pNFw( r ) of the 
halo, assuming an NFW profile [46]. Further, we assume 
the host halo is spherically symmetric. We then have 

_ _ | rv dr r 2 PNFwM^r) rd^ N /dr 
J rv dr r 2 Pnfw(^) r d^ N /dr 

where ^ m is the Newtonian potential of the halo, found 
by solving (see [49] for an explicit expression) 

V 2 ^ = 47tGpnfw, (12) 

and g(r) is the force modification. In order to calculate 
the force modification, we have to solve the chameleon 
field equation for an NFW halo [49]. This calculation 
is computationally expensive, so we instead use a sim- 
ple model which describes the exact results reasonably 
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well [49] ; in fact it undcrpredicts the exact result for the 
force modification, and thus is a conservative estimate. 
Specifically, 
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Here, r scr is the outermost radius at which the condi- 
tion |\Pjv| > 3|/h|/2 is met. In the large-field limit this 
condition is never met, so that r scr = and g{r) = 4/3 
throughout. Eq. (10) then yields M dyn /M = (4/3) 3 / 5 « 
1.22. For sufficiently small fields, the chameleon mecha- 
nism becomes active so that g(r) — > for r < r scr , thus 
modeling the screening of the modified force. In this case, 
Mdyn will interpolate between M and 1.22M. 

We show in Fig. 1 the mass threshold of the four cluster 
surveys in ACDM (solid) and the f(R) dynamical mass 
effect to these thresholds (dashed). Fig. 2 also shows the 
dynamical mass effect on the observed cluster abundance 
(dash-dotted lines). Note that the dynamical mass effect 
is not simply additive to the mass function enhancement, 
since the latter depends on mass as well. Due to the 
steepness of the halo mass function at the high-mass end, 
the fact that Mgz = Mdyn is larger than the true mass 
M significantly boosts the abundance of detected clusters 
above the mass threshold. The two effects of enhanced 
growth and increased Md yn both contribute to increase 
the observed cluster abundance. For z < 0.4, the mass 
function enhancement provides a significant contribution 
to the overall change in number counts, while at higher 
redshifts the increase in dynamical mass is the dominant 
effect. 



C. Halo clustering in f(R) 

In addition to the halo abundance, f(R) modified grav- 
ity also affects the clustering of halos. This effect comes 
from two sources: first, the matter power spectrum is 
enhanced on small scales by the increased gravitational 
forces. Second, the linear bias &l(M) of halos at a given 
mass M is reduced, since at a fixed mass halos are less 
rare in f(R) than in GR. The power spectrum of clusters 
of mass M is modeled as 



P h (k,z\M) = b L (M) 2 P L (k,z). 



(14) 



The halo bias is given by the peak-background split. For 
the mass function used, it is given by [2] 



b L (M) = 1 + 
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where v,a,p are defined after Eq. (6). Note that v is 
given in terms of the virial mass M v and thus, for a given 
cluster mass M, it differs in the two collapse scenarios 
because of different A v values. 

For the matter power spectrum in Eq. (14), we use the 
linear theory power spectrum for f(R) and ACDM. As 



shown in [20], this describes the non-linear power spec- 
trum at z = measured in f(R) N-body simulations 
up to scales k ~ 0.2/i/Mpc. In order to minimize the 
impact of non-linearities on the power spectrum and its 
covariance, we limit our Fisher matrix to modes with k 
less than 0.1/i/Mpc. Note that including smaller scales 
will further improve the constraints; however, a more 
sophisticated model including non- linear and/or scale- 
dependent bias, and the non-linear matter power spec- 
trum would be necessary in this case. 

Thus, the effect on the cluster power spectrum is due to 
three combined effects: enhancement of the linear power 
spectrum APi(fc), halo bias A6^(M, z), and the dynam- 
ical mass effect Md yn . Fig. 3 shows the relative devi- 
ation APh/Ph of the cluster power spectrum in f(R) 
with respect to ACDM for the Planck survey (Sec. II) 
as a function of redshift and wavenumber k. Plots for 
the other surveys investigated here show similar z— and 
k— dependences, though the amplitude of each effect de- 
pends on the survey. Here, we have assumed one mass bin 
M > A/ii m (z) and f m = 10~ 5 . Similar to dN/dz, we plot 
the total effect (upper left), and separately the effect due 
to AP^ik) (upper right), and A6i(M, z) (lower panel). 
For this field value, the dynamical mass effect is irrelevant 
since the clusters detectable by Planck are chameleon- 
screened. The departure from ACDM is mainly driven 
by A6l(M, z) which shows a strong redshift dependence, 
and only mildly affected by AP^(k) which is fe-dependent 
and only relevant on small scales. Given that the power 
spectrum is shot-noise dominated at all scales for the 
cluster samples considered, the effect on the linear halo 
bias in fact is the most important contribution to the 
f(R) constraints from the cluster power spectrum. 



IV. FISHER MATRIX FORMALISM 

The Fisher information Matrix (FM hereafter) is de- 
fined as 



d 2 ln£ 

dp a Pf) 



(16) 



where £ is the likelihood of a data set, e.g. a cluster sam- 
ple, written as a function of the parameters p a describing 
the model. The parameters p a comprise the cosmologi- 
cal model parameters as well as "nuisance" parameters 
related to the data set (e.g., mass calibration). 



A. Cosmological parameters 

Throughout this paper, we assume a spatially flat 
(Ofc = 0) cosmology. Our model comprises a total of 
seven cosmological parameters and one f(R) model pa- 
rameter which are left free to vary. The seven parameters 
and their fiducial values (in parenthesis, taken from the 
best-fit fiat ACDM model from WMAP 7yr data, BAO 
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FIG. 3: Relative deviations in the f(R) halo power spectrum from ACDM, i.e. AP^/Ph for the Planck survey, with |/ho| = 10 -5 . 
Upper left. Total deviation. Upper right. Deviation due to Ph(k) only. Lower left. Deviation due to halo bias &l only. For this 
value of /ro, the dynamical mass effect on the power spectrum is negligible and therefore we do not show it here. The redshift 
and scale dependence in the relative deviations from other cluster surveys are similar to the ones shown here. 



and Hq measurements [50]) are: baryon density parame- 
ter Q&ft, 2 (0.0245); matter density parameter us m = £l m h 2 
(0.143); dark energy density fl\ = 1 — O m (0.73); power 
spectrum normalization og (0.809); index of power spec- 
trum n s (0.963); effective dark energy equation of state 
through w(z) = wq + (1 — a)w a , with fiducial values 
wo = —1 and w a = 0. The Hubble parameter is then 
a derived parameter given by h = \Juj m / (1 — &a) = 0.73 
in the fiducial case. The f{R) modification can alter- 
nately be parametrized using the field amplitude /ro at 
z = 0, or the Compton wavelength A^o at z — (see 
Sec. IV F). Our fiducial value is fuo = Aco = 0. 

In the following, we first discuss the Fisher matrix 
for number counts and clustering of clusters, before de- 
scribing the calibration parameters and CMB priors. 
Throughout, we divide the redshift range into bins I of 
width Az = 0.02. Further, we bin clusters in logarithmic 
mass bins m of width AlnM = 0.3 from the minimum 
mass M lim (z) for each survey (Sec. II) up to a large cut- 



off mass of Mr, 



10 16 Mq. Since the mass limit varies 



with redshift, the number of mass bins thus also varies 
somewhat across the redshift range. 



B. Number counts 

The FM for the number of clusters iVj- 
l-th redshift bin and m-th mass bin is 



dN Lm dNi. 



1 



l,m 



Pa 



PP Ni. 



within the 



(17) 



where the sum over I and m runs over intervals in the 
whole redshift range z = — 1 and cluster mass range 
[M\i m (z), oo]. We can write the abundance of clusters 
expected in a survey, within a given redshift and mass 
interval, using the mass function as: 



N hm = AO A 



d 2 V 



dM 



oh 



dzdil j M[ 
din M n(M, z)p{M° h \M) 



(18) 



where Ail is the solid angle covered by the cluster sur- 
vey, lnM; im = ln7l/ii m (z;) +mAlnM, and n(M, z) is the 
mass function given in Eq. (9). Following Lima and Hu 
[51], wc take into account the intrinsic scatter in the re- 
lation between true and observed mass, as inferred from 
a given mass proxy, by the factor p(M oh \M) which is 
the probability for a given cluster mass with M of hav- 
ing an observed mass M ob . Under the assumption of 
a log-normal distribution for the intrinsic scatter, with 
variance of n M , the probability is 



p(M oh \M) 



exp[-a; 2 (M ob )] 



(19) 



A I 



where 



x(M ob ) = 



lnM ob -B M - hi M 



(20) 



M 



With these notations, we parameterize the M —M rela- 
tion, in addition to the intrinsic scatter, by a systematic 
fractional mass bias Bm- With this prescription, the final 
expression for the number count FM is: 



Here, b^g is the mass function weighted effective bias, 

J °° dM n(M, z)b L (M, z)(erfc[a; m ] - erfc[a; m+ i]) 



J °° dM n(M, z)(erfc[x m ] - erfc[x m+ i]) 



(24) 

The effective volume for mass bins m, n, wave number 
fcj, and redshift zi is given by (see App. A) 



V 



V (zi) 



= [P™"(fc 4 ,^)] 2 « m (^)««(^) (25) 
(n m P mm + l)(n n P nn + 1) 
-I- n m n n (P r 



7 nm 1(5 nm n -lN2 



where Vq{z) is the comoving volume of the redshift slice 
[zi — 0.01, zi + 0.01] covered by the given survey, and 
n m (zi) is the cluster number density for mass bin m at 
redshift z;. The effective volume gives the weight car- 
ried by each bin in the (z, k) space to the power spec- 
trum Fisher matrix, and hence quantifies the amount 
of information contained in a given redshift- and fc-bin. 
Fig. 4 shows the redshift and scale dependence of the 
effective volume for the four cluster surveys. We find 
that V c ff < 0.3 Vq for all redshifts and surveys consid- 
ered, even when not binning in mass, hence the cluster 
power spectrum is shot-noise dominated for all surveys. 
As the lower panel of Fig. 4 illustrates, Planck is most 
limited by shot noise, while SPTpol is least limited, as 
expected from their respective mass limits and coverage. 



AnAz d 2 V 



I . / it 



(21) 



2 dzdVL 

poo 

x / d In M n(M, z) (erfc[x m ] — erfc[a; m +i]) 
Jo 

where erfc(x) is the complementary error function. 
C. Power spectrum 



D. Calibration parameters 

In self-calibrating the true and observed cluster mass 
(Eq. (20)), we introduce four nuisance parameters which 
specify the magnitude and redshift-dependence of the 
fractional mass bias Bm{z) and the intrinsic scatter 
v\i-im{z). Following [51], we assume the following 
paramctrization: 



We define the FM for the power spectrum of galaxy 
clusters as 

1 v v dlnP h m "(fc,,z;) d\n.P^ n {kj,zi) 
<*/* (2^)2 2^2^ dpa dp 

v ' m,n i,i * H 

x V™ n - oS k 2 Ak (22) 

where the sum over m, n runs over mass bins, while the 
sum in I and i runs over intervals in the whole redshift 
range and wavenumber 0.01 /iMpc -1 < k < 0.1 /iMpc -1 
with Alog 10 fc = 0.017 respectively. P™ n (h,zi) is the 
cluster cross-power spectrum for mass bins m and n, cal- 
culated for the given redshift and wavenumber through 

PT(M = b^MbMPL^zO. (23) 



B M {z) = B M0 (l + z) a 

ClnM = ClnM,o(l + z)& (26) 

Therefore the four nuisance parameters are Bmo, ol 1 
cinA/.Oj and ft. A negative value for Bm corresponds to 
an underestimation of mass. The mass bias accounts for 
the possibility of a systematic offset in the calibration of 
the observable mass scaling relation. We adopt fiducial 
values of Bmo = 0, a = 0, cr\ n M = 0.1, /3 = 0. In deriv- 
ing the main results, we will not make any assumption 
on the four nuisance parameters and leave them free to 
vary. We will study the effect of assuming different priors 
on the four nuisance parameters on the f(R) constraints 
in Sec. VD . 
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FIG. 4: The dependence on redshift (top) and wavenumber 
(bottom) of the effective volume (Eq. (25)) for a single mass 
bin and each survey: Planck (black), SPT (green), SPTpol 
(magenta), and ACTpol (blue). The effective volume is a 
weak function of wavenumber k but strongly depends on the 
redshift. 



E. CMB Prior 

In the following, we present results with the Fisher 
matrix for the Planck CMB temperature power spec- 
trum Ci added to the constraints from cluster counts 
and power spectrum. We calculate the full CMB fisher 
matrix with CAMB [52] and method described in [53]. 
For the Planck experiment, we use the three frequency 
bands 100, 143 and 217 GHz, and the Ci are calcu- 
lated up to Z max = 2500. Our fiducial parameter set for 
the CMB experiment is, as described in the DETF re- 
port [54], 9 = (n s ,il b h 2 ,rt A ,rt m h 2 ,w ,A s ,T), where A s 
is the primordial amplitude of scalar perturbations and r 
is the optical depth due to reionization. After marginal- 
izing over the optical depth, we transform the Planck 
CMB fisher matrix to our cluster survey parameter set 
6' = (n s ,£li,h 2 ,Q,A,Q, m h' 2 ,wo,cr$) by using the appropri- 
ate Jacobian matrix. The CMB imposes strong prior 
on the cosmological parameters. For example, fl m .h 2 is 
known to be measured with the CMB power spectrum to 



an exquisite precision, and this helps in breaking param- 
eter degeneracies in the constraints from cluster surveys. 
As we shall see in Sec. V, the field amplitude parameter 
fno shows degeneracies with some of the cosmological 
parameters, so that the CMB prior also helps in further 
constraining fnQ. 

F. Non-Gaussian likelihood 

An inherent assumption in the Fisher matrix approach 
is that the likelihood can be approximated as Gaussian 
around its maximum; in other words, that one can do a 
reasonably accurate Taylor expansion of ln£ in all pa- 
rameters. Unfortunately, this is not the case for the 
parameter /^ , as the derivatives of the likelihood with 
respect to fjio diverge at the fiducial value Jrq = (see 
Fig. 5 in [15]). Thus, we choose the Compton wavelength 
Aco as a parameter instead of /rq, where for the f(R) 
model and fiducial cosmology considered here, 



A co « 32.53^^ Mpc. (27) 

With this choice, ln£ becomes analytic at the fiducial 
value Aco = 0. Specifically, we calculate the derivatives 
numerically as 

dln£ ln£(Ac7 )-ln£(0) 

7\ — = x ' ^ s > 

where Aco is the Compton wavelength evaluated at the 
chosen step size = Af R0 through Eq. (27), and £ 
denotes the likelihood from either dN/dz or P(k). Un- 
fortunately, the likelihood is still strongly non-Gaussian 
in the direction of Acoi an d the constraints depend on 
the step size Afno chosen to evaluate the Fisher matrix 
elements in Eq. (16). In principle, one would have to 
evaluate the full likelihood with a MCMC approach, and 
then perform a marginalization to obtain proper fore- 
casted constraints. Here, we opt instead for a simpler 
approach. We evaluate the Fisher matrix for a range of 
step sizes A/ro, and then quote the constraints for which 
tT (/fio) = Af R0 is satisfied. One can easily show that this 
gives the correct answer in the ideal case where the like- 
lihood is Gaussian in all other parameters. Note that 
while we always use Aco as parameter in the Fisher ma- 
trix, we will quote constraints in terms of fno in order to 
facilitate comparison with the literature, using Aco only 
to show parameter degeneracies. 

V. RESULTS 

We begin by discussing constraints from number 
counts (Sec. VA) and power spectrum (Sec. VB) 
separately, before moving on to combined constraints 
(Sec. V C) and the impact of external priors on the nui- 
sance parameters (Sec. VD). 
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FIG. 5: Fully marginalized 68% confidence level (CL) con- 
straints on /_ro from the number count of clusters only (using 
Planck CMB priors), as a function of the step size Afuo, for 
the surveys considered in this paper. The red dotted line in- 
dicates <7f R0 — Afuo. For a given survey, the intersection 
of this line with the predicted constraints yields the final ex- 
pected constraint (Sec. IV F). Solid (dashed) lines represent 
the case when dynamical mass is (is not) considered. The 
sharp upturn at A/ijo < 5 x 1CP 5 is due to the chameleon 
mechanism. 



A. Number counts 

As discussed in Sec. IV F, the Fisher constraints de- 
pend on the value of A/ro adopted to evaluate the nu- 
merical derivatives in the Fisher matrix. Fig. 5 shows 
the projected constraints for the different surveys as a 
function of A/rq. The sharp upturn at A/^o ~ 3x 10~ 5 
(SPT and ACTPol), Af R0 - 2 x 10~ 5 (SPTPol) and 
A/flo ~ 9 x 10~ 6 (Planck) signals the transition to the 
chameleon-screened regime, where the mass function en- 
hancement becomes negligible [21]. The shape of this 
transition depends on the mass limits of the different sur- 
veys, as more massive halos are screened for larger values 
of fm- The figure clearly shows that, with number counts 
alone, constraints cannot be tighter than <Jf R0 ~ 10~ 5 . 
Nevertheless, this still constitutes an order of magnitude 
in improvement over current constraints. It should also 
be noted that the use of the dynamical mass in the calcu- 
lations leads to a significant improvement in constraints 
in the large-field regime where the chameleon mechanism 
is not active. 

The precise constraints obtained at the intersection 
°7ro = A/flo are listed in Tab. I, along with the step size 
used for each survey. The relative constraining power of 
the different surveys can easily be interpreted by look- 
ing at AN/N shown in Fig. 2. The best survey to con- 
strain f(R) with number counts is Planck which shows 
prominent deviations in AN/N at low rcdshift, and yields 
a 68% CL constraint of af R0 = 2 x 10~ 5 . Although 
SPTpol shows significant differences in number counts 



FIG. 6: Same as Fig. 5, but from the power spectrum of 
clusters only (using Planck CMB priors). 



out to large redshifts, the relatively small survey volume 
compared to Planck limits the performance in constrain- 
ing f(R) to <7/ R0 ~ 3 x 10 -5 . It is interesting to notice 
that while their overall performance is similar, the con- 
straints leverage on clusters in almost disjoint redshift 
ranges. Therefore these surveys provide complementary 
information on f(R) constraints from number counts, 
making the overall result less susceptible to specific is- 
sues related to either low or high rcdshift clusters. An 
investigation of whether the combination of both cluster 
samples yields a significant improvement on the expected 
f(R) constraints would be worthwhile, but is beyond the 
scope of this paper. The other two surveys also present 
results highly competitive with current constraints, and 
not very different from SPTPol {a fm = 3 x 10" 5 ). A 
better investigation with a proper likelihood would be 
necessary in order to make more precise statements. 



B. Power spectrum 

Fig. 6 shows the constraints from the clustering of clus- 
ters alone as a function of step size. The constraints 
generally worsen as the step size decreases to very small 
values. This is because the likelihood around the fiducial 
model (ACDM) scales as Xco a , where a > 1, and hence 
the derivatives go to zero as the step size decreases. How- 
ever, constraints do not worsen dramatically as the step 
size crosses the chameleon threshold, because the modi- 
fication to the halo bias in f(R) persists even if the halos 
arc chameleon screened [21]. Furthermore, the deviations 
in the matter power spectrum on small scales also persist 
for field values Jro < 10~ 5 . As expected, the use of the 
dynamical mass does not affect the constraints for small 
field values where the entire cluster sample is chameleon 
screened. 

The constraints from power spectrum only are summa- 
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FIG. 7: Fully marginalized constraints on fno from the power 
spectrum of clusters only, as a function of maximum cluster 
redshift z max - A/ho = 10~ 5 was used for all values shown 
here. 



rized in the second column of Tab. I. For Planck (as well 
as marginally for SPTPol) the constraint on f^a from 
the cluster power spectrum is tighter than that from the 
abundance only. This is mainly because the power spec- 
trum retains sensitivity to f(R) effects even when the 
halos are chameleon screened. For ACTPol and SPT the 
power spectrum yields slightly less constraining power 
than number counts, as the disadvantage of not having 
all-sky coverage is not compensated by the relatively low 
mass threshold. 

In order to investigate what cluster redshift range con- 
tributes to the fno constraints, we the constraints (for 
A/fto = 10~ 5 fixed) as function of the maximum cluster 
redshift considered in Fig. 7. For surveys with mass limits 
which decrease with redshift, i.e. SPT and SPTpol, con- 
straints improve up to z max = 1, while for Planck all the 
information is derived from clusters below z m 0.3, and 
for ACT the constraining power comes from clusters be- 
low z rj 0.5. It is especially interesting to compare results 
from ACTpol and SPT, which detect a comparable num- 
ber of clusters overall but with a different redshift dis- 
tribution. Fig. 2 shows that ACTpol has a significantly 
higher number of clusters than SPT out to z m 0.5, and a 
lower mass limit out to z ~ 0.3. Yet the constraints from 
the cluster power spectrum are worse for ACTpol than 
SPT, due to the contribution from z > 0.5 clusters for 
SPT (Fig. 7) . How well each survey can realize their po- 
tential constraining power clearly depends on the precise 
Mii m (z) achieved in the final cluster sample. 

C. Combined constraints 



TABLE I: Marginalized constraints (68% confidence level) 
from the two cluster probes dN/dz and P{k), as well as the 
combination of both for the four SZ surveys. The results are 
combined with forecasted constraints from the Planck CMB. 
We also indicate the step size A/ro used for each survey and 
probe. 
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Fig. 8 shows constraints on //jo when combining both 
number counts and clustering, as a function of the step 
size AfuQ. The dependence on A/rq is similar to the case 
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FIG. 8: Combined dN/dz + P(k) 68% CL marginalized con- 
straints on fm) as a function of the step size A/ho for the 
different surveys. As in Fig. 5, the red dotted line shows the 
identity a fR0 = Af R0 . 



energy (CI a, w and w a ). Clearly, the combination of 
both observables yields a significant reduction in degen- 
eracies in all cases. The degeneracy with dark energy 
parameters also explains why the combined constraints 
on fno are slightly better for SPT than for ACTPol, even 
though the constraints from number counts and cluster- 
ing separately are very similar for the two surveys. By 
probing higher rcdshifts more effectively, SPT is able to 
better break degeneracies with dark energy parameters. 

Constraints on modified gravity show little with the 
power spectrum normalization (see Fig. 9). This is due 
to the fact that the high number of clusters detected 
allows for good characterization of the shape of the mass 
function beyond its overall normalization. Similar but 
somewhat weaker degeneracies are present for the other 
surveys. 



D. Uncertainties in scatter of mass observable 
relations 



of power spectrum-only and number counts-only con- 
straints at small and large step size respectively. Com- 
bining the two probes helps to break degeneracies and 
better constrain the nuisance parameters. As a result, 
the constraints on /rq show improvements with respect 
to those derived from power spectrum or number counts 
alone (third column in Tab. I). While Planck reaches a 
constraint of a m 5 x 1(T 6 , ACTpol, SPT and SPTPol 
achieve 2 — 3 x 10 -5 . Among the four surveys, the Planck 
survey thus yields the tightest constraints regardless of 
which cluster probe is being used. The relative merit of 
the Planck survey is due to its large area, which allows to 
detect massive clusters on the whole sky, and its ability 
to detect low redshift clusters. Fig. 2 shows that in the 
small-field regime (Jro ~ 10~ 5 ), the low redshift clusters 
drive the constraints for Planck, while low mass clusters 
do so for SPTPol. 

Up to now, we presented results with conservative mass 
limits, i.e. clusters are expected to be detected with 
S/N > 5 for all the surveys. We also examined improve- 
ments in the constraint cr/ H0 when using more optimistic 
mass limits for each survey, according to what is outlined 
in section Sec. II. For all surveys, the constraints from 
number counts only are hardly affected, since they are 
mainly set by the chameleon threshold. In each case, the 
larger cluster sample does improve the power spectrum 
constraints. However, only for Planck does this yield a 
significant improvement in the combined constraints (by 
a factor of 1.5 to 3 x 1CT 6 ), while for ACTpol, SPT, 
and SPTpol, the improvement in combined constraints 
is marginal. 

Fig. 9 illustrates the most important degeneracies 
of Xco with standard cosmological parameters for the 
Planck survey. Here, we show Xco instead of fm for 
purposes of presentation. The most prominent degenera- 
cies are with the amount and equation of state of dark 



Throughout this work, we have assumed a functional 
form for the scaling relations and then allowed the data to 
calibrate the parameters that characterize it. This pro- 
cedure is possible thanks to the large number of clusters 
that are expected to be detected in these surveys. Cur- 
rent strategies for deriving constraints from cluster sur- 
veys, however, rely on the calibration of scaling relations 
as obtained by a small subset of well studied clusters. 
In general, allowing more freedom to the scaling relation 
parameters may avoid biases induced by incorrect scaling 
relations but can also result in a degradation of the final 
result. In order to investigate the degradation of cr/ H0 due 
to this self-calibration, we repeat the forecasts assuming 
different priors on the four "nuisance" parameters. The 
result is summarized in Tab. II for the number counts, 
clustering, and combined, and for the four surveys. 

Here, the "weak prior" case assumes priors on the nui- 
sance parameters of ABm,o = 0.05 and Aa — 1, as well 
as AtJAifl = 0.1 and A(3 = 1, as suggested by compari- 
son between X-ray and lensing cluster mass measurement 
(e.g., the XMM- Newton measurements presented in [55]). 
The combined constraints on Jro are smaller than those 
for the default, no prior case, by about 25% for Planck, 
80% for ACTpol, and 50% for SPT and SPTpol. The 
most prominent improvements are seen in number counts 
only constraints (e.g. a factor 3.8 for SPT). 

The "strong prior" case assumes that all four nuisance 
parameters are fixed at their fiducial values. This as- 
sumption, which is anyway not realistic, would lead to 
improvements of about one order of magnitude with re- 
spect to the self-calibration results. 

This result suggests that although self-calibration does 
not in general lead to major degradations in the con- 
straints, good prior information on normalization and 
scatter in the mass-observable relation can improve con- 
straints considerably in partiuclar for the ACTpol and 
SPT/SPTpol surveys. 
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FIG. 9: Joint constraints on the Compton wavelength Aco (in Mpc; see Eq. (27)) and (counterclockwise from top left) Oa, erg, 
Wo, and w a . All curves denote 68% confidence level, and are for number counts only (blue), power spectrum only (cyan), and 
combination of the two (green). The results are shown for the Planck survey with A/ko = 5 x 10 -6 . 



On the other hand, it is important to keep in mind 
that self-calibration relies on a specific paramctrization 
of the mass-observable relation and its scatter, and ex- 
ternal measurements are important to validate these as- 
sumptions. As a worst-case scenario, we also considered 
the case of a single mass bin for each survey, i.e. ne- 
glecting all mass information on individual clusters. The 
fully marginalized, combined constraints on f^a (with- 
out any priors on bias and scatter) worsen by approxi- 
mately a factor of four for Planck and ACTpol. On the 
other hand, both SPT and SPTpol constraints degrade 
by only a factor of three, since both surveys has a large 
lever arm in redshift. While these constraints are con- 
siderably worse than when using mass bins, the Planck 
and SPTpol (?) constraints with a single mass bin still 
improve over current upper limits. 



TABLE II: Relative improvement in constraints on fuo, i.e. 
a /Ho Prl ° r / a ^Ro k ' wnen including weak priors on the mass- 
observable relation (see text). In each case, Afno is that 
given in Tab. I for the corresponding survey /probe. 



Survey 


/ho(10- 


6 ) 


Aco 


(Mpc/h) 




dN/dz 


P(k) 


dN/dz 


dN/dz 


P{k) dN/dz 








+P(k) 




+P(k) 


weak 


Planck 


1.00 


1.02 


1.23 


1.02 


1.01 1.11 


ACTpol 


2.14 


1.95 


1.82 


1.01 


1.17 1.36 


SPT 


3.80 


1.02 


1.48 


1.03 


1.01 1.21 


SPTpol 


1.29 


1.02 


1.45 


1.13 


1.00 1.22 
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VI. DISCUSSION 

It is worth comparing our forecasted constraints on 
fao with those obtained in Schmidt et al. [15] (Lombriser 
ct al. [16] obtained similar constraints). By combining 49 
Chandra X-ray clusters and using geometric constraints 
from CMB, supernovae, Ho, and BAO, they found an 
upper limit of f RQ < 1.4 x 10~ 4 (95% CL), including 
only the statistical error. Our forecasted constraints are 
tighter by a factor of - 3 - 4 (ACTpol, SPT, SPTPol) 
and ~ 15 (Planck), respectively. The main reasons for 
the tighter constraints are: the significantly larger cluster 
samples yielded by these surveys, the use of the dynami- 
cal mass (which improves number count constraints) , and 
the inclusion of the clustering of clusters as an observable. 
As shown in Sec. V, the latter in fact provides the dom- 
inant constraining power for these surveys in the small 
field limit. 

Furthermore, the constraints in [15] are dominated by 
the systematic uncertainty in the cluster mass scale, and 
including this systematic increases the upper limit to 
fm i$ 3 x 10~ 4 . The constraints presented here are 
marginalized over the cluster mass scale, and hence al- 
ready include this systematic. Indeed, the combination 
of power spectrum and number counts is essential in or- 
der to realize self-calibration without loosing constraining 
power. 

One interesting finding of our study is that the 
chameleon screening mechanism, a necessary ingredient 
in this modified gravity model in order to satisfy Solar 
System constraints, has a qualitative impact on the con- 
straints. In particular, the number counts by themselves 
cannot push constraints below fm ~ 10~ 5 due to this 
effect, while they yield the tightest constraints for larger 
field values. Similarly, the importance of the dynami- 
cal mass effect is controlled by the chameleon threshold. 
This is expected to hold for other modified gravity sce- 
narios as well, as long as the respective screening mech- 
anism depends mainly on the host halo mass (or poten- 
tial well) of the cluster. On the other hand, screening 
mechanisms that mainly depend on the average interior 
density, such as the Vainshtein mechanism employed in 
braneworld and galileon models, will show a qualitatively 
different behavior [49, 56] (sec [57] for a study of the re- 
lated symmetron mechanism). For such models, the util- 
ity of number counts will not be limited to certain pa- 
rameter ranges. Thus, taking into account the screening 
mechanism is crucial for obtaining realistic constraints 
on any viable modified gravity model, both for forecasts 
and when using actual data. 

All of the surveys considered here reach the limit set by 
the chameleon mechanism on the constraints from num- 
ber counts. The Planck survey achieves the tightest con- 
straints both due to its large volume, which reduces the 
sample variance especially in the cluster power spectrum, 
and due to its ability to detect clusters at z < 0.15. 
For example, if we limit the Planck cluster sample to 
z > 0.15, the combined constraints in fm degrade by a 



factor of four to ~ 2 x 10~ 5 . We thus expect that signif- 
icant improvements in constraining power are achievable 
for ground-based SZ surveys if the minimum cluster red- 
shift can be reduced. 

Several improvements upon our treatment here are 
possible. First, our model for the f(R) effects on mass 
function and bias of halos is conservative. In order to 
investigate this, we repeated the forecast using the stan- 
dard as opposed to modified spherical collapse parame- 
ters in the model prediction [21]. In case of the Planck 
survey, the fully marginalized, combined constraint is 
tightened by a factor of 5 — 6, constraining fm to less 
than 10~ 6 . This prescription overestimates the f(R) ef- 
fects in the small field regime (fm < 10~ 5 ) and thus 
leads to overly optimistic constraints. Nevertheless, the 
improvement in constraints signals that it is worth de- 
veloping a more accurate model for the f(R) effects on 
halo mass function and bias (e.g., along the lines of [58]). 
Given the importance of the cluster power spectrum in 
the constraints, an accurate model for the modified halo 
bias will be crucial. Furthermore, a model for the cluster 
power spectrum on mildly non-linear scales would also 
lead to tighter constraints by allowing fc max to be in- 
creased above the value of 0.1ft./Mpc adopted here. 



VII. CONCLUSIONS 

The large cluster samples expected from current and 
upcoming SZ surveys can be exploited to place tight 
constraints on modifications to gravity. We have shown 
that the Planck cluster sample will allow for more than 
one order of magnitude improvement in constraints on 
the field parameter fm over current observational con- 
straints, even when marginalizing over the expansion his- 
tory (parametrized by wo,w a ) and bias and scatter in the 
mass-observable relation. Similarly, SPT, SPTPol and 
ACTPol should provide improvements of about a factor 
3-4. Using number counts only, the Planck cluster cat- 
alog should be able to reduce errors to cr/ R0 = 2 x 10~ 5 
in the near future. The inclusion of the cluster power 
spectrum as a probe greatly improves results especially 
in the small field limit. The best constraint we obtain is 
for Planck (combined constraints, <jf R0 = 5 x 10 -6 ) and is 
mainly driven by the power spectrum. These constraints 
push into the regime not ruled out by Solar System tests 
[18]. Even with self-calibration, a good understanding of 
the cluster selection function will be necessary to realize 
this potential however. On the theoretical side, a better 
description of the modified gravity effects on halo mass 
function and bias should allow for further improvements. 
In addition, the use of a proper likelihood function would 
constitute an important validation of the results obtained 
here with the Fisher matrix approximation. 
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where the factor of 1/2 in front accounts for the fact 
that the density field is real, reducing the number of in- 
dependent modes by one half. This factor is sometimes 
neglected in the literature. The volume is given by 



v{z) = n sX 2 (z) 
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Appendix A: Covariance of Cluster Power Spectra 

In this appendix we derive the Fisher matrix element 
for the cross- and auto-power spectra of clusters binned 
in mass. Let P mn (k) denote the cross-power spectrum 
between mass bins m and n. In this section we will sup- 
press the explicit rcdshift-dcpcndcncc for clarity. The 
variance of the cross-power spectrum measured in a nar- 
row k range is given by 



a 2 (P mn (k)) 



1 



N, 



mod 



p mm (k) + 



i 



p nn (k) + 



l 



+ ( P mn {k) + - — 

Tim 



(Al) 



Using this, we can derive the general power spectrum 
Fisher matrix as 



i,j l,m _ 



d In P mn {k m , Zi ) d In P mn (k m ,zi) 



dp 
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with 

V mn ' eS (ki,zi) 

vm 
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(n m P mm + l)(n n P nn + 1) + n m n n (P nm + 8 nm n m l f 



Here, n m denotes the comoving number density of clus- 
ters in mass bin i, and the number of modes is given 



where all quantities in the denominator are evaluated at 
k m and zi. This is Eq. (25). 
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